Maitri Demonstration: Managing Large Scale Scientific Data (Demo)

نویسندگان

  • Rishi Rakesh Sinha
  • Arash Termehchy
  • Soumyadeb Mitra
  • Marianne Winslett
چکیده

Even traditional commercial database systems do not scale to the size of today’s large scientific data sets, whose growth is outpacing Moore’s Law. Instead, scientists are wedded to special-purpose data formats and their associated I/O libraries, even though these libraries provide only basic functionality. Thus there is a need for a scalable data management system that can support these formats and, when needed, provide more sophisticated functionality for indexing, buffering, caching, concurrency control, metadata management, and querying. This demonstration showcases Maitri, a framework that can be used to address these needs. The Maitri framework consists of a set of standard, very narrow interfaces for format-agnostic, loosely-coupled libraries offering aspects of the functionality listed above. Format independence is provided by Maitri’s block manager module, which encapsulates all code that is specific to a particular scientific data format, and calls the appropriate scientific I/O library to read and write data in that format. The demonstration shows the Rocketeer visualization toolkit running with subsets of the Maitri module implementations to visualize rocket simulation output data. The Rocketeer runs show the efficiency that can be gained by layering format-agnostic query, buffering, and/or indexing facilities atop scientific I/O libraries. The demonstration also shows that despite their narrow interfaces, the implementations of Maitri modules work together very effectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maitri: A Format-Independent Framework for Managing Large Scale Scientific Data

Even traditional commercial database systems do not scale to the size of today’s large scientific data sets, whose growth is outpacing Moore’s Law. Instead, scientists are wedded to special-purpose data formats and their associated I/O libraries, even though these libraries provide only basic functionality. Thus there is a need for a scalable data management system that can support these format...

متن کامل

Syntactic Structures of Sentences from Large Corpora

The demonstration will consist in displaying the syntactic structure of sentences from novels, scientific texts, newspapers, ... The syntactic structures are computed by our syntactic parser and the output shows in a human-friendly graphic interface (1) word features (as computed by POS tagger) (2) nonrecursive phrases (as computed by shallow parser) and (3) their relations (the functional stru...

متن کامل

AZDBLab: A Laboratory Information System for Large-Scale Empirical DBMS Studies

In the database field, while very strong mathematical and engineering work has been done, the scientific approach has been much less prominent. The deep understanding of query optimizers obtained through the scientific approach can lead to better engineered designs. Unlike other domains, there have been few DBMS-dedicated laboratories, focusing on such scientific investigation. In this demonstr...

متن کامل

Maitri: Format Independent Data Management for Scientific Data

Today’s scientific applications are very data intensive, and their data management requirements can no longer be met by special-purpose libraries for particular scientific data formats or by traditional database management systems. This paper proposes Maitri, a data-format-independent, loosely-coupled, application-tailorable set of libraries that provides a holistic data management framework fo...

متن کامل

Job Provenance - Insight into Very Large Provenance Datasets

Following the job-centric monitoring concept, Job Provenance (JP) service organizes provenance records on the per-job basis. It is designed to manage very large number of records, as was required in the EGEE project where it was developed originally. The quantitative aspect is also a focus of the presented demonstration. We show JP capability to retrieve data items of interest from a large data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007